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1 Introduction 



Jet clustering algorithms have become an indispensable tool for the analysis of hadronic 
final states in e + e~ annihilation. More recently they have also started to be applied 
to other types of particle collisions. Clustering algorithms have permitted a wide range 
of important tests of QCD and will be of continuing value in more refined studies and 
in searches for new physics. It therefore remains worthwhile to think of ways in which 
the existing algorithms can be modified to improve their theoretical properties and phe- 
nomenological performance. 

In the present paper we propose improvements which can be applied to any iterative 
clustering algorithm of the basic JADE type [1], although we mainly discuss them with 
reference to the so-called Durham or k± variant of that algorithm [2] . By the JADE type 
of algorithm we mean an exclusive iterative pairwise clustering scheme, in which jets are 
constructed out of primary objects, the latter being hadrons or calorimeter cells in the 
real experimental case and partons in the perturbative theoretical calculation. The term 
exclusive means that each primary object is assigned to a unique jet and each final state 
has a unique jet multiplicity, for a given value of the jet resolution parameter y cut . 

Algorithms of the JADE type have two basic components: 

1) a test variable y^, and 

2) a combination procedure. 

The test variable is used to decide whether the objects % and j should be combined, 
according to whether y^ < y cut . It is also used to choose which objects to consider next 
for combination, namely the pair with the smallest value of y^-. In the original JADE 
algorithm, y^ = M^/Q 2 where Q is the hard scale (i.e. the centre-of-mass or visible 
energy in e + e~ annihilation) and 



which is essentially the invariant mass-squared of the pair. The combination procedure 
specifies the properties of the new object formed by combining i and j, for example that 
its four-momentum should be simply the sum pij = Pi+Pj (the so-called E scheme, which 
we shall adopt in the present paper). The clustering procedure is repeated until no objects 
can be combined further (all y^ > y cut ), at which stage all objects are defined as jets. 

A crucial point is that the identical algorithm should be applicable to real experimental 
data and to the partons that appear in the perturbative calculation. It follows that an 
essential feature of the algorithm must be infrared safety, i.e. insensitivity to the emission 
of arbitrarily soft and/or collinear particles. Otherwise, the presence of massless partons 
in the perturbative calculation would lead to divergent results. Beyond this fundamental 
feature, the ideal algorithm should lead to a close correspondence between the theoretical 
(partonic) and actual (hadronic) jet characteristics and multiplicities after clustering, over 
the widest possible range of values of y cut . That is, it should have the smallest possible 
hadronization corrections, this being the name given to empirical adjustments that are 
applied to the theoretical predictions before comparing them with experiment. One would 
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also like the algorithm to have good theoretical properties, in particular resummability 
of large terms to all orders in perturbation theory and a reduced renormalization-scale 
dependence in fixed order. 

While the original JADE algorithm satisfies the basic criteria of infrared safety, its the- 
oretical properties turn out to be surprisingly complicated. This is because the invariant 
mass is not the most relevant variable for the evolution of QCD jets, and consequently the 
multijet phase space has a non-factorizing structure when expressed in term of the JADE 
test variable [3, 4]. As a result the range of theoretical predictions that can be compared 
with experiment using this algorithm is limited, and the hadronization corrections are 
not optimally small. 

A basic weakness of the JADE algorithm is illustrated by the way it deals with the 
parton-level 'seagull diagram' (Fig. 1), in the phase-space region where the two gluons g% 
and (?4 are soft and almost collinear with the quark and antiquark [3]. We then have 




Figure 1: The 'seagull diagram'. 



J/13 ~ ^3#l 3 , t/24 ~ ^4^24 » 2/34 ~ X 3 X A , (1-2) 

where Xj = lE^jQ. Since all these are of the same order in small quantities, there is an 
important subregion in which 2/34 is the smallest. In that subregion the two gluons will 
be combined first, making a 'phantom' gluon jet with a resultant momentum at a large 
angle to the quark and antiquark, where there are in fact no particles. 

The shortcomings of the JADE algorithm are greatly alleviated by the Durham mod- 
ification [2, 5, 6], which consists of replacing M?- in the definition of the test variable by 
fciy, where 

k 2 ±ij = 2mm{E i ,E j } 2 (l - cos%) , (1.3) 

which is essentially the relative transverse momentum-squared of % and j. 1 This choice of 
test variable reflects the more fundamental role of the transverse momentum in setting 
the scale of jet evolution, as the argument of the running coupling, and in defining the 
boundary between perturbative and non-perturbative physics. In the treatment of the 
seagull diagram, Eqs. (1.2) become 

2/13 ~ x 2 3 ef 3 , y 24 ~ x\el A , y 34 ~ min{^, x\} , (1.4) 

1 Historically, the first algorithm based on a similar variable (LUCLUS) was developed and included in 
the JETSET program by Sjostrand [7]. 
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and so the two gluons cannot be combined first in this region of phase space. As a result, 
one finds that the phase space now has simple factorization properties and the available 
range of predictions is increased. In particular, the terms in the perturbation series that 
involve leading and next-to-leading powers of lny cut can be identified and resummed to 
all orders [5, 6], thereby improving the reliability of the predictions at small values of y cut . 
Furthermore, hadronization corrections, estimated according to the best available Monte 
Carlo models [7, 8], are found to be reduced [9, 10]. 

The good features of the Durham algorithm at low y cut lead one to hope that it should 
be possible to probe the interface between perturbative and non-perturbative QCD by 
studying jet properties as a function of y cut in the relevant region, say y cut < 10~ 4 at 
Q = Mz, corresponding to k± < 1 GeV. One would like, for example, to see whether 
the low-energy behaviour of the effective strong coupling could be studied in this way 
[11, 12]. However, that hope is dashed by the observation [13, 14] that the hadronization 
corrections, although reduced, are still substantial at y cut ~ 10~ 3 , corresponding to k± ~ 3 
GeV, well above the values at which non-perturbative effects would normally be expected. 

In the following Section we discuss the defects of the Durham algorithm which lead 
to this situation, and the steps that can be taken to remedy them. We define modified 
algorithms and study the associated non-perturbative effects that would be expected 
in simple hadronization models. In Sect. 3 we compute the jet fractions for these new 
algorithms to next-to-leading order in perturbation theory and study their properties. 
Sect. 4 is devoted to studies of the mean jet multiplicity, first in resummed perturbation 
theory and then using the HERWIG event generator [8]. Comparisons show clearly the 
benefits of the new algorithms at small values of y cut , beyond the reach of fixed-order 
perturbation theory. Finally, in Sect. 5 we briefly summarize our results and conclusions. 

2 Jet algorithms: defects and cures 
2.1 'Junk-jet' formation 

The basic problem that we have to address is as follows. With decreasing y cut , all jet 
algorithms inevitably start to search for jets amongst the hadrons with low transverse 
momenta and to form spurious 'junk-jets' from them, pretending that these are legitimate, 
resolvable gluon jets. As we shall see below, the original JADE algorithm, with its test 
variable related to invariant mass, accumulates the first junk-jet at y cut ~ X/Q , where 
A ~ 0.5 GeV is a soft scale set by the mean non-perturbative transverse momentum. The 
Durham algorithm eventually falls into the same trap, although at lower y cut . 

One might expect a /cj_-based algorithm to avoid resolving junk-jets as long as y cut 
is kept above X 2 /Q 2 . The Durham algorithm, however, does not respect such a natural 
expectation. According to the clustering procedure adopted, one usually starts from the 
softest particle in a jet (call it hadron $4) and merges it with the one nearest in angle, 
to minimize the relative k±. Thus hadron $4 gets clustered not with the leading hadron 
in the jet but, typically, with the softest among the hadrons which happen to lie on the 
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same side of the jet axis in the transverse plane. Half of them do, on average. The 
typical transverse momentum of the largest junk-jet, and hence the y cut value at which 
it is resolved, is therefore enhanced by a factor proportional to N(Q), the soft hadron 
multiplicity in a jet at scale Q, which is proportional to ln p Q, with p depending on the 
model of soft physics. As a result, junk-jets start to appear at y cut oc ln 2p Q(X/Q) 2 . Taking 
account of fluctuations, such a misinterpretation proves to be a substantial effect. 

To understand these features in more detail, we can examine the action of the al- 
gorithms on simple models of the non-perturbative hadronization process. Consider for 
example the simplest possible string or 'tube' model, in which the hadronization of a 
back-to-back two-parton system yields a distribution of hadrons which is uniform in ra- 
pidity i] and strongly damped in transverse momentum p t , both variables being defined 
with respect to the original parton directions. If the particle density in (f],p tx ,p ty )-space 
is given by p(pt), we have (neglecting hadron masses) 

Q = E cm = J dr]d 2 p t p(p t )p t cosh rj = 2AsinhF (2.1) 

where Y ~ ln(Q/A) is the maximum value of and 

A = J d 2 p t p(p t )p t = N (p t ) /2Y (2.2) 

where the total hadron multiplicity in this model is N ~ (2 A/ (p t )) \n(Q/\). 

Ideally, we would like hadronization to affect the jet structure of events as little as 
possible, and therefore we would prefer the tube to remain classified as a two-jet final state 
down to the smallest possible values of the resolution y cut . Nevertheless any algorithm 
will eventually resolve a third (junk-) jet at some value y cut = y^, the algorithm should 
be designed to make this as small as possible. 

The action of various algorithms in resolving a third jet inside the tube is illustrated 
in Fig. 2. In the JADE algorithm, the resolution measure is related to the invariant mass, 
and therefore jets tend to be resolved as slices of the tube. In particular, an extra third 
jet is resolved when y cut ~ M 2 et /Q 2 , where Mj et is the mass of the jet in either hemisphere, 
given by 

M 2 ct = E 2 ct - p 2 ct = 2A 2 cosh Y ~ AQ , (2.3) 

so that 

(ys) J - ^ • (2-4) 

This is shown by the dashed curve in Fig. 3, to be compared with data from a Monte Carlo 
simulation of the tube model (squares). For this simulation, we simply generated a number 
TV of massless four-momenta with an exponential transverse momentum distribution and 
a uniform rapidity distribution in the interval — Y < rj < Y , Y being given by Eq. (2.1) 
and iV by Eq. (2.2). As illustrative values, we have taken A = 0.5 GeV and (p t ) = 0.3 
GeV. 

In the original Durham (D) algorithm, to resolve a third jet one has to subdivide one 
of the jets into two parts with relative transverse momentum P t , where 

P? ~ y c , t Q 2 ■ (2.5) 
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Figure 2: Resolving a third jet in the final state of the tube model: (a) JADE, (b) 
Durham, (c) angular-ordered Durham algorithm. 

The largest value of y cut at which this can be achieved occurs when one half of the tube 
is divided axially into two half-cylinders, as illustrated in Fig. 2(b), giving 

Pt~ f Y I d 2 p t \p tx \p(Pt) ~ -Ay (2.6) 

JO J 7T 

and hence 

This is shown by the solid curve in Fig. 3, which agrees well enough with the Monte Carlo 
tube model data (circles). We see the expected great improvement relative to the JADE 
algorithm, due to the power-suppression factor of 1/Q 2 rather than 1/Q . However, the 
presence of the log-squared enhancement factor means that the coefficient of 1/Q 2 is far 
larger than O (A 2 ), the order of magnitude that one might hope to be achievable with an 
optimal jet algorithm. 

An alternative way of estimating non-perturbative contributions to (2/3) has been 
proposed in Ref. [15]. At lowest order in perturbation theory, for any infrared-safe jet 
algorithm, this quantity is proportional to a s . In higher orders it is given by a power series 
in a s (Q), where the argument of the coupling is set by the only available hard-scattering 
scale Q = E cm . Now although the perturbative predictions may be expressed in terms of 
a s (Q), one cannot avoid sensitivity to the region of low momenta k <C Q inside integrals 
that contribute to those predictions. This sensitivity makes the perturbation series in 
a s (Q) strongly divergent at high orders, leading to power-behaved ambiguities. 

In the 'dispersive approach' of Ref. [15] these so-called renormalon ambiguities are 
resolved by assuming the existence of a universal low-energy effective strong coupling 
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Figure 3: Mean values of the three-jet resolution in the 'tube' hadronization model, for the 
J (square symbols), D (circles) and A, C (stars) jet clustering schemes. The corresponding 
curves show the approximate formulae discussed in the text. 



a e g(k) with sensible analyticity properties. This leads to genuine power-behaved non- 
perturbative contributions, which may be parametrized in terms of moments of Sa e s(k), 
the discrepancy between the effective coupling and its perturbative expansion in terms of 
a s (Q) (up to the order included in the perturbative contribution). 

As was observed in Ref. [15], the expectation based on the dispersive approach is that 
(1/3) should have a leading non-perturbative contribution proportional to 1/Q in the case 
of the JADE algorithm and proportional to In Q/Q 2 for the Durham algorithm. Thus 
the predicted power behaviour agrees in each case with that expected from the simple 
tube model, although the logarithmic enhancement of the non-perturbative contribution 
for the Durham algorithm is not as large as in the tube model. 

In Sect. 4 we shall present the results of studies of hadronization effects in a more 
realistic model. 
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2.2 Angular-ordered Durham (A) algorithm 



We now show that a simple modification of the Durham algorithm suffices to delay the 
onset of junk-jet formation, which results mainly from a non-optimal sequence of clus- 
tering, rather than from a poor definition of the test variable (as was the case for the 
JADE algorithm). The key to alleviating the problem is to notice that the most general 
definition of a clustering algorithm in fact involves three components rather than the two 
mentioned above, viz. 

0) an ordering variable Vij, 

1) a test variable yij, and 

2) a combination procedure. 

In other words, we now distinguish between the variable v^, used to decide which pair 
of objects to test first, and the variable y^ to be compared with the resolution parameter 
y cut . The algorithm then operates as follows. One considers first the pair of objects (ij) 
with the smallest value of v^. If < y cut , they are combined. Otherwise the pair with 
the next smallest value of V; L j is considered, and so on until either a y^ < y zut is found or, 
if not, clustering has finished and all remaining objects are defined as jets. 

From this viewpoint we see that JADE- type iv^ = y^) algorithms, including the 
Durham variant, are likely to produce the largest number of jets for a given definition of 
the test variable and combination procedure. Suppose that at some stage of clustering 
the pair (ij) with the smallest value of y^ does not have the smallest value of We 
can assume that y^ < y cut , since otherwise clustering has already finished. If the pair (kl) 
with the smallest value of has yM < y cu t, they will be combined. Otherwise, the pair 
with the next smallest value will be considered, and so on until is reached, whereupon 
i and j will be combined. Thus at least the (ij) clustering performed by the JADE-type 
algorithm is available at any stage. However, we cannot make a rigorous argument that 
the JADE-type jet multiplicity is an upper bound on that for v^ ^ y^ algorithms, because 
the sequence of clustering will be different in general. 

Once one separates the functions of the ordering and test variables, it is natural to 
adopt the relative angle for ordering purposes, thus taking into account the angular- 
ordering property of QCD [16]. Recall that angular ordering corresponds to the fact 
that a soft gluon emitted at some angle to the jet axis cannot resolve the colours of jet 
constituents at smaller angles. For a correct correspondence with the colour structure of 
the jet, one should therefore combine the constituents at smaller angles first. We shall 
refer to this choice of ordering variable, combined with the Durham test variable and 
combination procedure, as the angular- ordered Durham (A) algorithm. One starts with 
a table of the energies E { of primary objects and their relative angles 6^, or equivalently 
the quantities 



all defined in the e + e cm. frame. The A algorithm is then as follows: 

Step 1. Select the pair of objects (ij) with the minimal value of the ordering variable, 



= 2(1 - cos%) , 
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Step 2. Inspect the value of the test variable, 



Uij = mm{Ei, Ej} 2 Vij . 

If i/ij < y cut , then update the table by deleting i and j, introducing a new particle 
(ij) with 4- momentum pij = Pi + pj, and recomputing the relevant values of the 
ordering variable; then go to Step 1. 

If Uij > Vent, then consider the pair (if any) with the next smallest value of the 
ordering variable and repeat Step 2. If no such pair exists, then clustering is finished 
and all remaining objects are jets. 

The earlier discussion of the logarithmic enhancement of non-perturbative effects in 
the Durham algorithm showed that it arises because spurious junk-jets can be formed 
by combining soft, large- angle (low-rapidity) hadronization products. The A algorithm 
avoids this by choosing small-angle combinations preferentially. Then the two real jets in 
the final state of the tube model are built up from the high-rapidity ends of the tube, and 
the first junk-jet to be resolved will be the wide-angle hadron or cluster with the largest 
transverse momentum, 

/ \A \Pt,max) ,~ _s 

(ys) ~ g 2 , (2-9) 



as represented schematically in Fig. 2(c). The relationship between ^ max y and (p t ) 
depends on the multiplicity N and the form of the ^-distribution. For an exponential 
Pi-distribution we have 

{pLJ-((Pt)^N) 2 (2.10) 



and therefore 

A 



(Vs) A - (^lnln(Q/A)j . (2.11) 

This is shown by the dot-dashed curve in Fig. 3, which again agrees quite well with the 
Monte Carlo data (stars). 



2.3 Mis-clustering 

It is difficult to see how the value of (y 3 ) could be reduced much further by refinement 
of the A algorithm, since fluctuations in p t will always lead to a multiplicity-dependent 
enhancement of (pt, m ax) relative to the intrinsic non-perturbative scale set by (pt) NP - There 
is however a remaining weakness of the algorithm that, while having no perceptible effect 
on (1/3), can be expected to show up more in studies of multi-jet rates and the internal 
structure of jets. It has to do with the wrong assignment of soft wide-angle radiation to 
already resolved jets. 

Consider the parton-level application of the algorithm to the basic splitting q 1 — > qi+g 2 
accompanied by a soft large-angle gluon g 3 (Fig. 4), such that 

E 1 > E 2 > E 3 , 12 < 13 w 9 23 . 
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Figure 4: Parton branching with soft, large-angle gluon emission. 

According to the angular ordering property of QCD, the soft gluon g 3 is radiated 
coherently and the corresponding radiation intensity is proportional to the total colour 
charge of the relatively narrow system qi + g 2 , which is just that of the initial quark. 
However, in half of the events the A (or D) algorithm will erroneously assign to the 
gluon jet #2 because the latter happens to lie a little closer in angle to g$ than the quark. 

At first sight, this does not seem to do any harm. The three partons are going to 
be identified as a 1-, 2-, or 3-jet system depending on the chosen value of y cut , and the 
corresponding probabilities of the 1-, 2-, 3-jet configurations remain unaffected by mis- 
clustering. This is true as long as one is concerned only with the jet fractions and not 
with the internal structure of the jets. However, after having resolved this system as two 
jets at some resolution y\ one may attempt to choose another yet smaller y 2 <C y± and 
to study the internal sub-jet structure by examining the history of merging. In this case 
the results will be misleading because one will find that the gluon sub-jet is artificially 
overpopulated. The same mis-clustering will occur when all three partons in our example 
are gluons, although in this case the practical damage may be less pronounced. 

2.4 Cambridge (C) algorithm 

The modification we propose in order to reduce the mis-clustering effect consists of 'freez- 
ing' the softer of two resolved objects to prevent it from attracting any extra partners 
from among the remaining objects with larger emission angles. The corresponding Cam- 
bridge (C) algorithm is thus defined as follows. As before, one starts with a table of the 
energies E-i of primary objects and their relative angles as given by the ordering variable 
v i:j = 2(1 - cos 6y. 

Step 0. If only one object remains in the table, then store this as a jet and stop. 
Step 1. Otherwise, select the pair of objects (ij) having the minimal value of the ordering 
variable, Vij. Order the pair such that E-i < Ej. 
Step 2. Inspect the value of the test variable, 

Vij = E i r U ■ 

• If yij < y cut , then update the table by deleting i and j, introducing a new particle 
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(ij) with 4- momentum p^ = Pi + Pj, and recomputing the relevant values of the 
ordering variable. 

• If Uij > Ucut, then store i as a jet and delete it from the table. 
Step 3. Go to Step 0. 

This procedure suffices to prevent mis-clustering in the dominant region of phase space 
where, in the above example, the gluon g 2 is softer than the quark q\. It will still cluster 
the partons incorrectly in the subleading configuration where the gluon (call it g± now 
since E\ > E 2 ) is accidentally harder than the quark (g 2 ). After having resolved them, 
it will erroneously assign the soft gluon g 3 to g 1 (the more energetic one), spoiling the 
association between the clustering and the colour factor. A similar mistake will occur in 
the case of a resolved g — > qiq 2 splitting plus a soft wide-angle gluon g 3 : the algorithm 
will inevitably cluster g 3 with the quark or the antiquark, which is incorrect because the 
source of g 3 is the coherent octet colour of the pair, not a triplet. However, the fraction of 
events of these types should be small, and it is hard to see how such occasional mistakes 
can be avoided without distinguishing between different species of partons. 

As already mentioned, the additional 'soft freezing' step in the Cambridge algorithm 
does not affect the quantity (y 3 ) significantly. This is shown by Fig. 3, where the points 
for the A and C algorithms coincide within the plotting resolution. 

We do, however, expect the internal jet structure resulting from the A and C algo- 
rithms to be different. The aim of the freezing is to prevent soft jets from acquiring 
particles that do not belong to them. A good diagnostic quantity for this is the mean 
number of particles in the first junk-jet to be resolved in the tube model, i.e. the third 
(softest) jet when y cut = y 3 . In the optimal algorithm this quantity, (n 3 ), should be as 
small as possible. 

Our results on (n 3 ) for the various algorithms are presented in Fig. 5. As could be 
guessed from Fig. 2, the values in the JADE and original Durham algorithms are quite 
similar, both being somewhat below one quarter of the total multiplicity N. As shown by 
the curve, {n 3 ) D is in fact about 0.22iV, with (n 3 ) J rising slightly less rapidly. Therefore 
in this respect the unmodified Durham algorithm performs slightly worse than JADE 
asymptotically. The angular-ordered Durham algorithm performs a good deal better, but 
the population of the junk-jet still grows steadily with increasing energy. Introducing 

the soft freezing step as well (C algorithm) kills the growth of the spurious jet almost 

c 

completely, and the value of {n 3 ) remains near 2 at all energies. 

Taking Figs. 3 and 5 together, we see that the Cambridge algorithm performs best 
in limiting both the amount of junk-jet formation and the growth of junk-jets once they 
have formed. 

In real hard processes like e + e~ annihilation, jet algorithms have to operate on final 
states containing both genuine hard jets and spurious junk-jets, and should ideally sup- 
press the latter as far as possible while remaining sensitive to the former. We shall study 
these issues in more depth in later Sections, but a few qualitative observations can be 
made here. First, we would expect the angular ordering in the A and C algorithms to 
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Figure 5: Mean number of particles in the third jet at y cut = y% in the 'tube' hadronization 
model, for the J (square symbols), D (circles), A (stars) and C (diamonds) jet clustering 
schemes. The curve shows 0.22 times the total number of particles. 



reduce the number of jets at a given value of y cut , relative to the D algorithm, because, 
as remarked earlier, separating the ordering and test variables generally increases the 
number of combinations attempted. This effect should become more pronounced as the 
number of objects increases, i.e. with increasing energy and/or decreasing y cut . On the 
other hand, the soft freezing step in the C algorithm tends to enhance the number of 
jets, relative to the A algorithm, by forbidding combination with frozen jets. Again this 
difference should be more pronounced for a larger number of objects. 

It should be emphasised that neither the angular-ordering nor the soft freezing mod- 
ifications affect the good properties of the Durham algorithm with respect to the resum- 
mation of large logarithmic terms at small values of y cut . The leading and next-to- leading 
logarithms of y cut come from kinematic regions where the sequence of clustering is not 
affected by the extra steps in the modified algorithms. Thus the only adjustment to re- 
summed predictions will be in the subleading 'remainder' terms, as will be discussed in 
more detail in Sec. 4. 
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In terms of computing, the modified algorithms are slightly less convenient than the 
original JADE-type (including Durham) ones, because of the new distinction between 
the ordering and test variables. In a JADE-type algorithm, the sequence of clustering is 
independent of the value of y cut . Therefore the clustering can be done once and for all, the 
sequence can be stored, and all questions about jet multiplicity and substructure can be 
answered subsequently without much further computation. This is the strategy used in 
the KTCLUS package [17]. Once the ordering and test variables are separated, however, the 
clustering sequence depends in general on the value of y cut and all computations involve 
a complete reclustering up to that value. 

For example, in the original Durham algorithm the value of y 3 for a given event is 
defined precisely by the point in the clustering sequence at which three objects become 
two, independently of y cut . In the modified algorithms, the calculation of y 3 involves 
repeatedly reclustering the event with different values of y cut , until the value at which three 
objects become two is found with adequate precision, e.g. by a binary search procedure. 
The extra computation might have caused difficulties when the JADE algorithm was 
originally proposed, but advances in computing over the past ten years mean it is no 
longer a problem. 



3 Fixed-order results 

It is common in the specialized literature to define the n-jet fraction f n (y) by 2 

U(«) = =^ = — , (3.1) 

where y is the jet resolution parameter. If <7 to t identifies the total hadronic cross section 
Ctot — °o(l + « s / 7r + •••)> a o being the lowest-order Born cross section, then the constraint 
E»/»(y) = 1 applies. 

For the choice /i — Q — E cm of the renormalization scale, one can conveniently write 
the n-jet fractions (e.g., for the cases n = 2, 3 and 4) in the following form [10]: 

fM = 1 - (g) A(y) + (2A(y) - B(y) - C(y)) + (3.2) 

My) = (£) A{y) + (£) 2 {B{V) " 2A{y)) + (3 ' 3) 



where the coupling constant a s and the functions A(y),B(y) and C(y) are defined in 
some renormalization scheme (in what follows we shall make use of the MS scheme). The 
terms of order O (a|) involving A(y) take account of the normalization to <7 to t rather than 

to (7q. 



2 From now on, in order to simplify the notation, we shall often use y to represent y cut . 
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Figure 6: The parton level B(y) function entering in the two- and three-jet fractions (3.2)- 
(3.3) at NLO in the D (continuous line), A (dashed line) and C (dotted line) schemes. 

For the original Durham jet algorithm, several studies were carried out in Ref. [10], 
together with a parametrization of the auxiliary functions A(y),B(y) and C(y). There, 
analytic fits of the form 

F(y) = j2kJ In-) (3.5) 
n=0 V yJ 

were performed for F = A,B and C. The coefficients k n were tabulated for values of y 
in the range 0.01-0.33 in the case of A(y) and B(y), whereas for C(y) the interval was 
0.01-0.1. 

In this Section we present results similar to those of Ref. [10] for the case of the 
two modifications of the original Durham clustering scheme that we are proposing, the 
'angular-ordered Durham' and 'Cambridge' schemes, and we compare the performances 
of all three algorithms in predicting several results of phenomenological relevance in QCD 
analyses. In particular, in order to match the advances on the experimental side in 
resolving jets at very small values of y, in our studies we have extended the range of the 
jet resolution parameter down to the minimum figure of 0.001. However, in this respect, 
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we have to stress that at very small values of y (i.e., around 0.001) the perturbative 
calculations become unreliable, because of the growing five-jet contribution, which of 
course does not appear in our O (ctg) computation. In other words, for small y's, terms 
of the form O (a™ ln m y) with m < 2n become large and have to be resummed [18] 3 . In 
producing the numerical results presented in this Section we have made use of the Monte 
Carlo (MC) program EERAD [22], which implements the complete next-to-leading-order 
(NLO) corrections to the e + e~ — > 2 jet and e+e~ — > 3 jet rates, as well as the leading- 
order (LO) contributions to e + e~ — > 4 jets. For the purpose of illustration, the total 
cm. energy Q is always chosen to be the mass of the Z-boson and no QED initial-state- 
radiation (ISR) has been included. The numerical values are M z = 91.17 GeV for the 
mass and Y z = 2.516 GeV for the width (as we use the e + e~ — > Z, 7* — > . . . annihilation 
rates). 




y 



Figure 7: The parton level C(y) function entering in the four-jet fraction (3.4) at LO in 
the D, A (continuous line) and C (dashed line) schemes. 

Up to the order O (a|), differences in the jet fractions (3.2)-(3.4) as predicted by the 

3 We look forward to the availability in numerical form (see, e.g., Ref. [19]) of the complete O (a|) 
results, toward which contributions have been made by two different groups [20, 21]. 
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three jet algorithms D, A and C can only occur when calculating the NLO contributions 
to the two- and three-jet rates and the LO ones to the four-jet cross section. More 
precisely, only the functions B(y) and C(y) can vary depending on the scheme adopted. 
In particular, B(y) turns out to be different for all three jet algorithms, whereas C(y) 
is the same for the D and A schemes but differs for the C scheme. This can easily be 
understood if one recalls that, in comparison to the D algorithm, the A algorithm only 
involves a modification in the combination scheme (i.e., the 'angular ordering') used in 
the n-parton — > (n — m)-jet transitions (with n > 4 and 1 < m < n — 1), whereas the 
C algorithm also affects the n-parton — > n-jet rates for n > 4 (i.e., via the 'soft freezing' 
procedure). Indeed, note that for n = 3 partons, kinematical constraints impose that, 
on the one hand, the two closest particles are also those for which y is minimal and, on 
the other hand, the identification of the softest of the three partons as a jet implies that 
the remaining two particles are naturally the most energetic and far apart. This clearly 
implies that the A(y) function is the same for all three algorithms. For the above reasons 
then, we only plot the distributions for the B(y) and C{y) functions. This is done in 
Figs. 6 and 7, respectively. 

+ - 

e e -> 5 jets at LO 
12500 rn 1 1 1 1 1 — i — i — i — i 1 




0.02 



Figure 8: The parton level E(y) function entering in the five-jet fraction at LO in the D, 
A (continuous line) and C (dashed line) schemes. 



15 



e e -> 2,3,4 jets 



a 

o 

o 

Sh 



1.0 



0.8 



0.6 



0.4 



0.2 



0.0 





1 


i i i i i 

- 


I I I I I 

^^^^ 


1 

2jet 




















// 












// 












// 












// 












// 












// 












// 
// 










// 












// 












/ / 


x\ 










// 


^\ 










A/ 
















3jet 








i 


' ~ " 1 


J I 1 I 1 


i i 1 





0.001 



0.005 0.01 



0.05 



0.1 



Figure 9: The O (a^) two-, three- and four-jet fractions at parton level in the D (contin- 
uous lines), A (dashed lines) and C (dotted lines) schemes. 



Whereas at LO all the contributions to the three-jet cross section come from the parton 
process e + e~ — > qqg, at NLO contributions to the three-jet rates are due to two sources. 
First, the real emission diagrams for the processes e + e~ — > qqgg and e + e~ — > qqQQ, 
in which one of the partons is unresolved. This can happen when one has either two 
collinear partons within one jet or one soft parton outside the jet cones. Both these 
contributions are (in general, positively) divergent. Thanks to the Bloch-Nordsieck [23] 
and Kinoshita-Lee-Nauenberg [24] theorems (see also Ref. [25]), these collinear and soft 
singularities are cancelled at the same order in a s by the divergent contributions (generally 
negative) provided by the second source, namely the virtual loop graphs. Therefore, after 
renormalizing the coupling constant a s a finite three-jet cross section is obtained. The 
function B(y) accounts for the above-mentioned three- and four-parton contributions. 
The LO four-jet cross section at a given y-value is due to four-parton events for which 
all the measures y^- are greater than the resolution parameter. These contributions are 
contained in the C(y) term. 

From Fig. 6, one can appreciate that, whereas differences between the two proposed 
modifications of the Durham algorithm are negligible, those between these two and the 
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original one can amount to several tens of percent in a sizable part of the experimentally 
relevant y-range, especially for y ~ 0.01. Let us then investigate their behaviour. It is clear 
that B(y) for the A and C algorithms is consistently smaller (both in magnitude and sign) 
than for the D one. This turns out to produce a smaller three-jet fraction, compare to 
Eq. (3.3), where the B(y) term enters with a positive sign (the leading piece proportional 
to A(y) clearly dominates). This is rather natural since, as discussed in Sect. 2.2, the 
D algorithm represents a sort of upper bound in jet multiplicity to its variation A. The 
'angular ordering' procedure tends to combine more pairs of partons than the original D 
clustering does, as the former would pair the two closest particles even when their y- value 
is not the smallest of the entire event. If these two partons yield the minimum y, then the 
two combination schemes coincide. Conversely, this inevitably produces an enhancement 
of the lowest order jet fraction, that is the two-jet one, see Eq. (3.2), where B(y) is indeed 
preceded by a minus sign and the C(y) term is unaffected by the A prescriptions. The 
additional step of 'soft freezing' implemented in the C algorithm tends to enhance the final 
jet multiplicity of the original D scheme, by preventing the softer particle in a resolved jet 
pair from attracting the remaining particles (at large angle) into unresolved parton pairs, 
which would then be merged together, producing a lower number of final jets. In this 
respect, therefore, the yield of the clustering procedure of the original D algorithm acts, in 
terms of final jet multiplicity, as a lower bound on that produced by the implementation 
of the 'soft freezing' step. This can be seen from Fig. 7, for the auxiliary function C(y) 
to which the four-jet rate (3.4) is proportional, though the effect remains at the level of 
2 — 4% over the considered range in y. The increase of the four-jet rate is compensated for 
by a further decrease (with respect to the A scheme) of the C-scheme three-jet fraction, 
as can be seen in the B(y) term from Fig. 6 and Eq. (3.3). However, contrary to the 
'angular ordering' case, when 'soft freezing' is adopted the two-jet rates are much less 
affected, as both the two terms B(y) and C(y) vary and in opposite directions. 

As an additional exercise, in order to show that, as previously discussed, the effects 
of 'soft freezing' do become more evident as the final state multiplicity increases, we plot 
in Fig. 8 the (a s /27r) 3 coefficient of the five-jet fraction at LO (which we call E(y)), as 
obtained from the programs already employed in Refs. [26, 27]. Here, differences between, 
on the one hand, the D and A schemes and, on the other hand, the C one, can be quite 
large, more than twice those obtained in the case of the C(y) function (compare to Fig. 7). 
For example, at y — 0.001 these amount to 6% and become relatively larger with y. At 
the upper end of the interval considered in Fig. 8, y — 0.02, they are at the level of 15%. 

In summary, at the order O (a|), one could conclude the following: whereas 'angular 
ordering' tends to shift a part of the three-jet rate into the two-jet fraction without 
affecting the four-jet cross section, 'soft freezing' enhances the fraction of four-jet events 
and reduces the three-jet one, with a small effect on the two-jet rate. Note that the A 
scheme only implements the first step, so that the corresponding effect is more promptly 
visible. In contrast, as the C scheme adopts both these procedures, in this case the two 
effects tend to mix together in the final rates. However, in this case one expects the 
'angular ordering' effects to be dominant over those due to the 'soft freezing', as can be 
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Figure 10: The parton level three-jet fraction at NLO as a function of the renormalization 
scale /i at y — 0.005 and y = 0.050, for the D (continuous lines), A (dashed lines) and C 
(dotted lines) schemes. 



argued by comparing Figs. 6 and 7. These effects are summarised in Fig. 9, where the 
two-, three- and four-jet fractions as defined via eqs. (3.2)-(3.4) are plotted. As we do 
not possess the complete O (a|) result, note that we are unable to present similar results 
for the five-jet fraction. The value adopted in Fig. 9 for a s is 0.120, as given in Ref. [28]. 

In the remainder of this Section we focus our attention on the three-jet fraction. 
In particular, we would like to point out that, since the algorithms D, A and C have 
identical three-jet rates at LO, it is evident from Fig. 9 and Eq. (3.3) that the NLO 
corrections to fz{y) contained in the coefficient function B(y) are significantly smaller 
in the two new algorithms. From the point of view of the convergence of the QCD 
asymptotic expansion, and hence of the reliability of the theoretical predictions, this 
should represent an improvement. In the sense that, provided one adopts a suitable value 
of the scale parameter \x regulating the corrections (see Ref. [29] for discussions of the best 
choice of the subtraction scale), it might not be unreasonable to argue that jet algorithms 
having smaller NLO terms may also have smaller higher-order corrections. Clearly, this 



18 



can be assessed only when the latter are computed. However, some hints on this point 
can be deduced from studying the scale dependence of the complete NLO results. In fact, 
we know that the O (ctg) corrections are guaranteed to cancel the /i-dependence of the 
O (ag) three-jet fraction up to the order O (cxf). 

The /i-dependence of the three-jet rate is simply introduced in Eq. (3.3) by the two 
substitutions 

a s - aM, B(y) - B(y) - A(y)(3 In ^, (3.6) 

where @ is the first coefficient of the QCD /3-function, that is fio — 11 — 2iV//3, with Nf 
the number of flavours active at the energy scale Our results are shown in Fig. 10, 
where the value of fz{y) is plotted as a function of the scale parameter fi/Q = fi/Mz, 
over the range between 1/10 and 2, at two fixed values of the jet resolution parameter. 
Note that for the strong coupling constant we have used the two-loop expression, with 
Nf = 5 active flavours and A^- = 250 MeV (yielding a s (M z ) = 0.120, to match the rates 
given in Fig. 9). 

Although the structure of the QCD perturbative expansion does not prescribe which 
value should be adopted for the scale /x, an obvious requirement is that it should be of the 
order of the energy scale involved in the problem (i.e., the cm. energy Q). Such a choice 
prevents the appearance of large terms of the form (a s \n(fi/Q)) n in the perturbative 
series. Therefore, it is reassuring to recognise that in Fig. 10 the A and C curves are more 
stable than the D ones around the value fi/Q = 1, especially at smaller y- values, where 
the differences in fs(y) among the three schemes are larger. For example, over the range 
1/2 < ji/Q < 2 the variation of f%{y) between its maximum and minimum values at 
y = 0.005 is sa 2.7% for the D scheme and w 1.3% for A and C. For y = 0.050 differences 
between the three algorithms are smaller, as fz{y) varies by ps 10% in the first scheme 
and ?s 8.7% in the other two. 
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Table 1: Parametrization of the three- and four-jet QCD functions A, B and C as polyno- 
mials X)n ^n(ln(l/|/)) n , for the Durham algorithm and its variants. The range of validity 
in y is given for each case. 

We conclude this Section by presenting a polynomial fit of the form (3.5) to the A, B 
and C functions in the D, A and C schemes. The range of validity of our parametrization 
extends from y = 0.001 (for all three auxiliary functions) up to y = 0.2 for A and B and 
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y — 0.1 for C. This should allow for an easy comparison between data and perturbative 
predictions over the y-range normally adopted in phenomenological studies (see, e.g., 
Ref. [14]). The values of the coefficients k n , with n — 0, ...4, are reported in Tab. I. 



4 Mean number of jets 

In Sect. 2 we studied the mean value of 2/3, the value of y cut at which three jets are 
just resolved, as a measure of jet algorithm performance, using a simple 'tube' model to 
estimate hadronization effects. That model had no perturbative contribution to y 3 , and 
so we could explore the region of low y cut simply by increasing the energy (see Fig. 3). In 
the presence of perturbative contributions, (ys) is not so small (about 0.02 at Q = Mz). 
Therefore for more realistic investigation of the interplay between perturbative and non- 
perturbative effects, we choose here to investigate a different quantity, namely the mean 
number of jets, n jcts . This may be written in terms of the individual n-jet fractions (3.1) 
as 

00 

n iats ^U{y) = Y j nf n {y). (4.1) 

71=1 

The mean number of jets has the advantage that it can be studied as a function of 
the jet resolution y, down to arbitrarily low values, at fixed energy. Furthermore, its 
perturbative behaviour at very low values of y can be computed including resummation 
of leading and next-to-leading logarithmic terms to all orders [13]. This behaviour can be 
matched with the fixed-order results of the previous section, to give reliable predictions 
(apart from non-perturbative contributions) throughout the whole range of y cut . Non- 
perturbative contributions can then be estimated by comparing the perturbative results 
with those of a Monte Carlo event generator such as HERWIG [8]. 

4.1 Resummed predictions 

Using the notation of the previous Section, we have from Eqs. (3.2)-(3.4) 

U{y) = 2 + (£) My) + £) 2 {B{y) + 2C{y) - 2A{y)) + ••■ • (4 - 2) 

The behaviour of the first-order coefficient A(y) at small y is of the form 

A(y) = C F (\n 2 y + 3\ny + r(y)) , (4.3) 
where Cf = 4/3 and the non-logarithmic contribution is [5, 6] 

r(y) = 6 In 2 + ^- ^ + 4 (ln(l + y/2) - 2^2) rfj - 3.7y lny + O (y) . (4.4) 

For the second-order coefficient the expected behaviour, for all three versions of the 
algorithm, is 



F{y) = B{y)+2C{y)-2A{y)=C F 



1 C A In 4 y - i (C A - N f ) \n 3 y + (in 2 y) 



12 



, (4.5) 
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where Ca = 3. The terms of order In 2 y depend in general on the version of the algorithm 
(D, A or C). To find them, we made a fit of the form (3.5), restricted now to the region 
0.001 < y < 0.02, with the coefficients k 3 and k± fixed at the values prescribed by 
Eq. (4.5) and ko, fci, &2 as free parameters. The results are given in Tab. 2. Using these 
fits for y < 0.01 and those in Tab. 1 for 0.01 < y < 0.1, we obtain simple, smooth 
parametrizations of the second-order coefficient F(y) over the whole range of y < 0.1. 
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Table 2: Parametrization of the second-order coefficient in the average number of jets as 
a polynomial J2 n kni) 10 -^ / y)) n , for the Durham algorithm and its variants. The range of 
validity is y < 0.02. 

To obtain resummed perturbative predictions for the mean number of jets, we now 
proceed as in Ref. [13]. To next-to-leading logarithmic (NLL) accuracy, the resummed 
results are independent of the version of the algorithm. Therefore the only modifications 
to the predictions for the original D algorithm come from the matching to the fixed-order 
results given above. We simply subtract the first- and second-order terms of the NLL 
resummed result and substitute the corresponding exact terms. Denoting by M q the NLL 
multiplicity in a quark jet, given in [13], we obtain 

A%) = 2M q {y) + C F (g) r(y) + (g)' (F(y) - 2F q (y)) (4-6) 
where F q is the second-order coefficient in Af q , given in [30]: 

F M = C F {±C. 4 y - L (Ca - N,) rt v + % (x - g) [( 4 g - l) g - l] ]n> ,} 

(4.7) 

The resulting predictions for the three algorithms, using a s = 0.12 as in the previous 
Section, are shown in Fig. 11. 

4.2 Monte Carlo studies 

We have used the Monte Carlo event generator HERWIG [8] (version 5.9 [31]) to study 
non-perturbative corrections to the above parton-level predictions. The program gener- 
ates a shower by coherent branching of the partons involved down to a fixed transverse 
momentum scale ~ 1 GeV, and then converts these partons into hadrons using a cluster 
hadronization model. The jet algorithms can be run both on the results of the parton 
shower and on the final-state hadrons: the difference between these is customarily used 
as an estimate of the hadronization correction. 

Fig. 12 shows the results for various jet algorithms at Q — Mz- On the right are 
the Durham algorithm and the two variants of it proposed in this paper, with angular 
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Figure 11: Resummed predictions for the mean number of jets at parton level in the D 
(continuous lines), A (dashed lines) and C (dotted lines) schemes. 



ordering (A) and with angular ordering and soft freezing (C). On the left, for comparison, 
are the JADE algorithm and its corresponding variants, in which the same ordering and 
freezing steps are introduced, but the JADE test variable (1.1) replaces (1.3). 

Since the aim was to study the specific process e + e~ — > hadrons at a scale equal to the 
full centre-of-mass energy, HERWIG was set not to produce any initial-state radiation 
(ZMXISR=0). Each data point was calculated using 2000 events. A different pair of random 
number seeds was used for each value of y cut in order for the results to be uncorrelated. 
The statistical errors are smaller than the size of the points. 

At the parton level, the effects of the modifications to the basic JADE and Durham 
algorithms are scarcely visible in Fig. 12. Looking in more detail at the Durham re- 
sults in comparison with the resummed predictions (Figs. 13-15), we see that in each 
case HERWIG at the parton level (squares) somewhat overestimates the number of jets, 
corresponding to a larger effective value of a s . The dot-dashed curves correspond to 
a s = 0.120, as in Fig. 11, whereas the dashed ones, which lie close to the Monte Carlo 
results, are for a s = 0.126. The value a s = 0.114 (dotted) corresponds to interpreting the 
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HERWIG input parameter QCDLAM, for which we used the default value of 0.18 GeV, as 
the NLO scale parameter A^. Such an interpretation is only justified in a small region 
of phase space (see [32]) which we would not expect to be dominant, so a larger effective 
value is not surprising. 

Turning to the hadron-level results, we see that the differences between the various 
algorithms are much more evident than at parton level. In particular, the jet multiplicity 
is reduced at y cut ~ 10~ 2 in the A and C algorithms, resulting in substantially smaller 
hadronization corrections for y cut ~ 10~ 3 . At larger values, the hadron-level points even 
fall below the parton-level ones, suggesting a negative hadronization effect or possibly a 
positive non-perturbative contribution at the HERWIG parton level, which could account 
for the apparently high effective a s . The presence of cut-offs and kinematic boundaries in 
the parton shower could indeed give rise to non-perturbative parton-level contributions. 
This illustrates the potential danger of interpreting the hadron-parton difference as an 
estimate of non-perturbative effects and simply adding it to the resummed prediction. In 
the case of the A and C algorithms, our results suggest this would lead to an overestimate 
of as- 
Comparing the hadron-level results in Fig. 12, it can be seen that the angular order- 
ing criterion alone is not enough to guarantee low hadronization corrections: it is also 
necessary to use the Durham test variable. In the 'seagull diagram' (Fig. 1) there is a 
significant fraction of phase space where the JADE test variable for the two gluons is less 
than y cut , whereas those for either gluon and its associated quark are both greater than 
y cut . In this region it will still be the two gluons that are combined even though this is 
the last of the three possibilities to be considered. As a consequence, the angular-ordered 
JADE algorithm still leads to 'phantom jets' in regions where there are no particles, 
enhancing the jet multiplicity and hadronization effects. 

It is interesting to see that the angular-ordered JADE algorithm with soft freezing 
performs even worse in this context than its equivalent without freezing. This is because 
the freezing step increases the mean number of jets at the hadron level whilst having little 
effect at the parton level. 

Fig. 16 shows the equivalent results at a typical LEP2 energy of 172 GeV. It is useful to 
reduce the hadronization corrections at this energy since that may help in discriminating 
between the process e + e~ — > WW — > 4 jets, used to measure the W mass, and the 
background from e + e~ — > Z /'-/ — > 4 jets. Here the results have been continued down to 
y cut = 2 x 10~ 5 , the point at which non-perturbative corrections might be expected to set 
in at this energy. 

The results at 172 GeV are similar to those at 90 GeV, except that they exhibit a 
lower multiplicity for all six algorithms at a given y cut , owing to the running of a s , and 
that the parton- and hadron-level curves diverge at a smaller value of y cut , owing to the 
power-decrease in the hadronization corrections. 
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5 Discussion and conclusions 

In this paper we have identified some deficiencies of the Durham (D) jet clustering al- 
gorithm and their causes. They are due mainly to the tendency of the algorithm to 
assemble 'junk-jets' from soft, wide-angle hadrons or partons, and to mis-cluster particles 
into already-resolved soft, wide-angle jets. To suppress junk-jet formation, we propose 
the separation of the ordering and resolution testing steps of the algorithm. Using the 
relative angle as the ordering variable and the /cj_-resolution as the test variable, we ob- 
tain the angular-ordered Durham (A) algorithm. To reduce mis-clustering, we propose 
'soft freezing', i.e. forbidding further clustering with the softer jet when a pair of jets are 
resolvable but have the smallest angle. Together with angular ordering, this gives the 
Cambridge (C) algorithm. 

As measures of performance of jet algorithms, we studied two quantities in a simple 
'tube' model of hadronization: (y 3 ), the mean value of the resolution at which three jets 
are just resolved, and (n 3 ), the mean multiplicity in the just-resolved third jet. Since 
the tube model has no real multijet production, in an optimal algorithm these quantities 
should both be as small as possible. We found that (y 3 ) decreases much more rapidly 
with increasing energy in the Durham algorithm than in the original JADE scheme. 
Nevertheless the Durham value is enhanced by junk-jet formation, and (1/3) is therefore 
reduced significantly by the angular-ordering step in the A and C schemes. The quantity 
(713), on the other hand, is quite similar in the JADE and Durham schemes, smaller 
but still increasing in the A scheme, and essentially flat in the C scheme. Thus the two 
measures provide neatly complementary evidence of the benefits of the two new features 
of the C algorithm. 

Next we presented full NLO perturbative calculations of the jet fractions in the new 
A and C algorithms, and compared the results with those for the original D scheme. 
The main difference is that the angular ordering in the new algorithms shifts a part 
of the three-jet rate into the two-jet fraction, while the extra 'soft freezing' in the C 
scheme moves some of the three-jet rate into the four-jet one. This is understandable 
because angular ordering tends to permit more clustering, while soft freezing prevents 
some clustering. The beneficial results include a significant reduction in the sensitivity 
to the renormalization scale. 

We also extended the parametrization of the fixed-order results for the D scheme in 
Ref. [10] down to y cut = 1CT 3 and provided similar parametrizations for the A and C 
schemes. 

For a more realistic estimate of hadronization effects than that obtained from the 
tube model, we investigated the mean number of jets as a function of jet resolution in 
the region of low y cnt . We combined the fixed-order results for this quantity with those 
obtained by resummation of leading and next-to-leading logarithms of y cut to all orders, 
thereby obtaining reliable perturbative predictions for all y cut < 10" 1 . The resummed 
terms are the same for the D, A and C algorithms; only the subleading logarithms and 
non-logarithmic parts had to be fitted differently in each case. This extended the results 
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of Ref. [13] for the jet multiplicity in the D scheme to include the A and C schemes. 

Finally, we used the Monte Carlo program HERWIG to generate simulated e + e~ 
annihilation events, and compared the parton- and hadron-level jet multiplicities for the 
Durham algorithm and its proposed modifications with the perturbative predictions and 
with each other. We also investigated the equivalent modifications to the JADE algorithm, 
but these either showed no benefit or performed worse than the original. 

We found that the mean number of jets at the HERWIG parton level was similar in 
the D, A and C schemes, and close to that predicted perturbatively, albeit for a rather 
high effective value of a s . The HERWIG hadron-level values were more different, being 
lowest for the A scheme, slightly higher for the C scheme, and much higher for the original 
D scheme at small y cut . Down to y cut ~ 1CT 3 at E cm = Mz (10~ 4 at E cm = 172 GeV), 
the C-scheme hadron-level values lie close to the perturbative ones, if the HERWIG 
input parameter QCDLAM is interpreted as the QCD scale parameter A^-. In addition, 
the hadronization effects are greatly reduced below y cut ~ 10~ 3 , suggesting that the C 
algorithm will be particularly useful for exploring the interface between perturbative and 
non-perturbative dynamics [12]. 

A Fortran subroutine to perform jet clustering according to the C algorithm, CAMJET, 
may be obtained via the World-Wide Web at 

http : //www . hep . phy . cam . ac . uk/theory/webber/ camj et/camj et . html. 
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Figure 12: Parton-level (squares) and hadron-level (circles) results from HERWIG on 
mean number of jets at Q — Mz for various jet algorithms as labelled. 
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Figure 13: Results on the mean number of jets at Q — M z for the original Durham 
algorithm. HERWIG: squares, parton level; circles, hadron level. Resummed: dashed, 
a s = 0.126; dot-dashed, a s = 0.120; dotted, a s = 0.114. 
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Figure 14: Results on the mean number of jets at Q — Mz for the angular-ordered 
Durham algorithm. Points and curves as in Fig. 13. 
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Figure 15: Results on the mean number of jets at Q — Mz for the Cambridge algorithm. 
Points and curves as in Fig. 13. 
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Figure 16: Parton- and hadron-level HERWIG results on the mean number of jets, as 
Fig. 12 but at Q = 172 GeV. 
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